Picture for Songyou Peng

Songyou Peng

WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction

Add code
May 28, 2026
Viaarxiv icon

Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Add code
May 21, 2026
Viaarxiv icon

GeoFlow: Enforcing Implicit Geometric Consistency in Video Generation

Add code
May 18, 2026
Viaarxiv icon

Image Generators are Generalist Vision Learners

Add code
Apr 22, 2026
Viaarxiv icon

CityRAG: Stepping Into a City via Spatially-Grounded Video Generation

Add code
Apr 21, 2026
Viaarxiv icon

Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective

Add code
Apr 15, 2026
Viaarxiv icon

UFO-4D: Unposed Feedforward 4D Reconstruction from Two Images

Add code
Mar 05, 2026
Viaarxiv icon

Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment

Add code
Dec 21, 2025
Viaarxiv icon

Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation

Add code
Aug 11, 2025
Viaarxiv icon

LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering

Add code
May 29, 2025
Figure 1 for LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering
Figure 2 for LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering
Figure 3 for LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering
Figure 4 for LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering
Viaarxiv icon